Search results for "Text Simplification"
showing 10 items of 13 documents
Attention-based Model for Evaluating the Complexity of Sentences in English Language
2020
The automation of text complexity evaluation (ATCE) is an emerging problem which has been tackled by means of different methodologies. We present an effective deep learning- based solution which leverages both Recurrent Neural and the Attention mechanism. The developed system is capable of classifying sentences written in the English language by analysing their syntactical and lexical complexity. An accurate test phase has been carried out, and the system has been compared with a baseline tool based on the Support Vector Machine. This paper represents an extension of a previous deep learning model, which allows showing the suitability of Neural Networks to evaluate sentence complexity in tw…
A Neural Network model for the Evaluation of Text Complexity in Italian Language: a Representation Point of View
2018
Abstract The goal of a text simplification system (TS) is to create a new text suited to the characteristics of a reader, with the final goal of making it more understandable.The building of an Automatic Text Simplification System (ATS) cannot be separated from a correct evaluation of the text complexity. In fact the ATS must be capable of understanding if a text should be simplified for the target reader or not. In a previous work we have presented a model capable of classifying Italian sentences based on their complexity level. Our model is a Long Short Term Memory (LSTM) Neural Network capable of learning the features of easy-to-read and complex-to-read sentences autonomously from a anno…
A sentence based system for measuring syntax complexity using a recurrent deep neural network
2018
In this paper we present a deep neural network model capable of inducing the rules that identify the syntax complexity of an Italian sentence. Our system, beyond the ability of choosing if a sentence needs of simplification, gives a score that represent the confidence of the model during the process of decision making which could be representative of the sentence complexity. Experiments have been carried out on one public corpus created specifically for the problem of text-simplification.
Design, development and validation of a system for automatic help to medical text understanding
2020
Abstract Objective The paper presents a web-based application, SIMPLE, that facilitates medical text comprehension by identifying the health-related terms of a medical text and providing the corresponding consumer terms and explanations. Background The comprehension of a medical text is often a difficult task for laypeople because it requires semantic abilities that can differ from a person to another, depending on his/her health-literacy level. Some systems have been developed for facilitating the comprehension of medical texts through text simplification, either syntactical or lexical. The ones dealing with lexical simplification usually replace the original text and do not provide additi…
Extracting Semantic Knowledge from Unstructured Text Using Embedded Controlled Language
2016
Nowadays, most of the data on the Web is still in the form of unstructured text. Knowledge extraction from unstructured text is highly desirable but extremely challenging due to the inherent ambiguity of natural language. In this article, we present an architecture of an information extraction system based on the concept of Embedded Controlled Language that allows for extracting formal semantic knowledge from an unstructured text corpus. Moreover, the presented approach has a potential to support multilingual input and output.
Multi-class Text Complexity Evaluation via Deep Neural Networks
2019
Automatic Text Complexity Evaluation (ATE) is a natural language processing task which aims to assess texts difficulty taking into account many facets related to complexity. A large number of papers tackle the problem of ATE by means of machine learning algorithms in order to classify texts into complex or simple classes. In this paper, we try to go beyond the methodologies presented so far by introducing a preliminary system based on a deep neural network model whose objective is to classify sentences into more of two classes. Experiments have been carried out on a manually annotated corpus which has been preprocessed in order to make it suitable for the scope of the paper. The results sho…
DeepEva: A deep neural network architecture for assessing sentence complexity in Italian and English languages
2021
Abstract Automatic Text Complexity Evaluation (ATE) is a research field that aims at creating new methodologies to make autonomous the process of the text complexity evaluation, that is the study of the text-linguistic features (e.g., lexical, syntactical, morphological) to measure the grade of comprehensibility of a text. ATE can affect positively several different contexts such as Finance, Health, and Education. Moreover, it can support the research on Automatic Text Simplification (ATS), a research area that deals with the study of new methods for transforming a text by changing its lexicon and structure to meet specific reader needs. In this paper, we illustrate an ATE approach named De…
Machine Learning Models for Measuring Syntax Complexity of English Text
2019
In this paper we propose a methodology to assess the syntax complexity of a sentence representing it as sequence of parts-of-speech and comparing Recurrent Neural Networks and Support Vector Machine. We have carried out experiments in English language which are compared with previous results obtained for the Italian one.
Towards text simplification for poor readers with intellectual disability: When do connectives enhance text cohesion?
2013
Abstract Cohesive elements of texts such as connectives (e.g., but, in contrast) are expected to facilitate inferential comprehension in poor readers. Two experiments tested this prediction in poor readers with intellectual disability (ID) by: (a) comparing literal and inferential text comprehension of texts with and without connectives and/or high frequency content words (Experiment 1) and (b) exploring the effects of type and familiarity of connectives on two-clause text comprehension by means of a cloze task (Experiment 2). Neither the addition of high frequency content words nor connectives in general produced inferential comprehension improvements. However, although readers with ID wer…
A recurrent deep neural network model to measure sentence complexity for the Italian Language
2019
Text simplification (TS) is a natural language processing task devoted to the modification of a text in such a way that the grammar and structure of the phrases is greatly simplified, preserving the underlying meaning and information contents. In this paper we give a contribution to the TS field presenting a deep neural network model able to detect the complexity of italian sentences. In particular, the system gives a score to an input text that identifies the confidence level during the decision making process and that could be interpreted as a measure of the sentence complexity. Experiments have been carried out on one public corpus of Italian texts created specifically for the task of TS…